|
|
Accession Number |
TCMCG041C18591 |
gbkey |
CDS |
Protein Id |
XP_019054221.1 |
Location |
complement(join(1372307..1372531,1372646..1373365,1374737..1375036,1379892..1379949,1387900..1388043,1388158..1388414,1391690..1391896,1392539..1392712,1393659..1393726,1396409..1396467,1399372..1399447,1399534..1399639,1399738..1399859,1406769..1406933,1409443..1409528,1417395..1417459,1430675..1430722,1440624..1440743,1441544..1441619,1443880..1443977,1444102..1444240,1444703..1444794)) |
Gene |
LOC104602754 |
GeneID |
104602754 |
Organism |
Nelumbo nucifera |
|
|
Length |
1134aa |
Molecule type |
protein |
Topology |
linear |
Data_file_division |
PLN |
dblink |
BioProject:PRJNA264089 |
db_source |
XM_019198676.1
|
Definition |
PREDICTED: DNA mismatch repair protein MSH1, mitochondrial isoform X4 [Nelumbo nucifera] |
CDS: ATGAGTTGGTGTCTGAAGTGGATGAGACATGATCGAGTGGGCGAGCTTTGTTTATGGGTGCTGAAAGTTGTCCAGAACATGGATGCAGCATGCAGAGGACAAGGTGATTGGATCCATTGTTTCAAGACGGAGAGGCTTTCGAGGGGAAATGTGAAAGCAACTAAAAAACTGAAGGAAGCAAAACCTATTCTAGAAGAAAAAGATCATTCTCATATAATGTGGTGGAAAGAGAGATTACAGTTTTTCAGAAAGCCTTCTTCCATCCAACTGGTTAAACGGCTTACCTATTCAAATTTGCTAGGTGTAGATGACAGCCTGAAAAATGGGAGTTTGAAAGAGGGTACACTCAACTGGGAGATGCTGCAATTTAAGATGAGGTTTCCACGTGAAGTTTTACTATGTAGAGTTGGGGATTTTTATGAAGCTATTGGTATTGATGCGTGTGTTCTTGTTGAGCATGCTGGTTTGAATCCCTTTGGTGGTTTGCGTTCAGATAGTATTCCAAGAGCAGGATGCCCTGTTATGAACTTGCGACAAACCTTGGATGATTTGACACGCAATGGATATTCAGTCTGCATAGTTGAGGAAGTTCAAGGTCCAACTCAAGCTCGTTGTCGCAAAGGTCGTTTTATTTCCGGGCATGCACATCCTGGTAGTCCTTATGTTTTTGGACTTGCTGGGGCTGATCATGATGTTGATTTCCCTGAACCAATCCCTGTAGTTGGAGTATCTCGTTCTGCAAAAGGGTATTGCATAACTTCAGTGCTGGAGACTATGAAGACATTTTCAGTGGATGATGGTCTTACTGAGGAGGCTATAGTAACCAAGCTACGCACTTCTCGATACCAACATTTATTTCTGCACACATCTCTGAAACACAACTCAGCGGGTTTCACTGCAGGTACTTCTCGGTGGGGAGAATTTGGTGAAGGGGGTATGTTGTGGGGAGAATGTACTGGCAAGCACTTTGAGTGGTTTGATGGTGATCCAATCACTGAGATTTTATTCAAGGTAAAGGAGATCTATGGTCTTGATCATGATGTTTCCTTTAGAGATGTCACTGTTTCTCCCGAGAAAAGGCCACGACCTTTGCACCTTGGAACAGCCACACAAGTTGGTGCCATACCTACAGAGGGAATACCCAGCTTGTTGAAAGTGCTGCTCCCTGCAAGTTGTGTTGGCCTTCCTGTACTGTATATAAGAGATCTTCTTCTTAATCCACCGGCATATGTGATTGCATCGGCAATTCAAGAAACATGCAAAATTATGAGTGGTGTAACGTGTTCGATCCCTGAGTTTACATGTGTGCCAGCTGCCAAGCTTGTGAAGCTATTGGAATCCAGAGAGGCAAATCATATTGAATTCTGCAGAATCAAGAACATAGCTGATGAAATCCTGCAGATGTATAAAAGCTTTGAGCTATGTGATATTCTAAAACTTCTAATGGATCCTACTTGGGTTGCCACTGGGTTGAAAGTTGAATTAAAGACCTTGGTAAAAGAGTGTGAATGGGTGTCAAATCGAATTGGTGAAGTGATTCTTCTGGATGGTGAAAGTGATCAAAAATTCAGTTCCTTTCTTGCAATTCCAAGTGAATTTTTTGTAGATATGGAATCTTCATGGAAAGGGCGTGTGAAGAGGATTCATGCTGAGGAAGCGTATGCAGAAGTGGAGAAGGCAGCTGAAGCCTTATCTATAGCAGTTATGGAAGATTTTCTTCCAATTATTTCAAGAATAAAAGCTACAGCAGCTCCCCTTGGGGGTCCCAAGGGAGAAGTATCATATGCCAGAGAACATGAAGCTGTTTGGTTTAAAGGAAAACGTTTTGCACCCACTGTTTGGGCTGGTACTCCTGGGGAACAAGAAATCAAACAGCTTAGACCTGCTACAGATTCAAAAGGGAGAAAGGTTGGAGAAGAATGGTTTACTACAAAGAAGGTGGAGGATGCGCTACTTAGATACCATGAAGCAGGTGATAAGGCAAAGGCTACAGTATTAGCATTATTGAGAGGACTTTCTGCTGAGTTACAGGACAAAATAAACATTCTTGTCTTTGCTTCTATGTTGCTTGTCATAGCAAAAGCACTATTTTCTCACGTCAGTGAGGGTAAAAGGAGGAAGTGGGTTTTTCCTACCCTTGTTGAGTTCCCTAAGAGTAAGGATAGAATATCATCACATGGGGCAAACAAAATGCAGATATTTGGTTTATCACCTTATTGGTTTGATATAGCACAAGGCAATGCAATACATAATACAGTTGACATGCAATCATTGTTTCTTTTGACCGGGCCAAATGGGGGTGGTAAATCTAGTTTGCTTCGATCAATTTGTGCAGCTGCATTACTTGGAATATGTGGATTGACGGTGCCTGCAGAGTCGGCACTTATTCCACATTTTGACTCTATTATGCTTCACATGAAATCTTATGATAGCCCTGCTGATGGAAAAAGCTCCTTTCAGATTGAAATGTCAGAGATTCGTTCCATAATAGCTGGGGCCACTGCAAGGAGCCTTGTTCTTGTTGATGAAATATGTAGGGGTACAGAAACAGCAAAAGGAACATGTATTGCTGGTAGCATTGTTGAGACACTTGATAACATTAGTTGCCTTGGTGTTGTTTCTACCCACTTGCATGGGATTTTTGATCTACCACTAAACACAAAGAACATTGTATATAAAGCCATGGGATCAGAGAACTTAAATGGTCATACACGGCCAACATGGAAATTGATAGATGGAATCTGTAGAGAAAGTCTTGCCTTTGAAACAGCCCAAGGGGAAGGCATCCCTGAAACAGTAATCCATAGAGCAAAAGAACTGTATCTTTCATTAAATGAAAAGGAGGATGCATCTTCAGGAAAAAGTGATGCAAAAGTGGAACATCTTAGTTCAGATTCTGATGAAGTCGAAGAGCAATTGCATAGGGTTAAGATAGGAGCTATTGGTATGAGGATGAAGGCATTGAATTCTGTAGAGATTCTACGAAAGGAAATAGCAAGTGCTGTTACCATAATCTGTCAGAAGAAACTGATAGAGTTATACAAACAGAGAAATATTTCAGAACTTACTGAGGTCAATTGTGTCATTATCTCTTCTAGGGAACAACCACCTCCATCAACTATAGGTGCTTCAAGTGTCTATGTGCTTCTGAGACCTGACAAGAAATTATATGTTGGACAGACGGATGACCTTGAGGGTAGAGTCCGTGCTCACCGTTCAAAGGAAGGGATGCAGAATGCTTCGTTCCTTTATGTTATAGTCCCAGGAAAGAGCATAGCTAGCCAACTGGAAACTCTATTAATTAACCAGCTTCCTCATCAAGGCTTTCGGCTCACAAACATTGCAGATGGAAAGCATCGTAACTTTGGCACATCCAGTCTCTCCTTAGAAAGTGTCGTCTTGTAA |
Protein: MSWCLKWMRHDRVGELCLWVLKVVQNMDAACRGQGDWIHCFKTERLSRGNVKATKKLKEAKPILEEKDHSHIMWWKERLQFFRKPSSIQLVKRLTYSNLLGVDDSLKNGSLKEGTLNWEMLQFKMRFPREVLLCRVGDFYEAIGIDACVLVEHAGLNPFGGLRSDSIPRAGCPVMNLRQTLDDLTRNGYSVCIVEEVQGPTQARCRKGRFISGHAHPGSPYVFGLAGADHDVDFPEPIPVVGVSRSAKGYCITSVLETMKTFSVDDGLTEEAIVTKLRTSRYQHLFLHTSLKHNSAGFTAGTSRWGEFGEGGMLWGECTGKHFEWFDGDPITEILFKVKEIYGLDHDVSFRDVTVSPEKRPRPLHLGTATQVGAIPTEGIPSLLKVLLPASCVGLPVLYIRDLLLNPPAYVIASAIQETCKIMSGVTCSIPEFTCVPAAKLVKLLESREANHIEFCRIKNIADEILQMYKSFELCDILKLLMDPTWVATGLKVELKTLVKECEWVSNRIGEVILLDGESDQKFSSFLAIPSEFFVDMESSWKGRVKRIHAEEAYAEVEKAAEALSIAVMEDFLPIISRIKATAAPLGGPKGEVSYAREHEAVWFKGKRFAPTVWAGTPGEQEIKQLRPATDSKGRKVGEEWFTTKKVEDALLRYHEAGDKAKATVLALLRGLSAELQDKINILVFASMLLVIAKALFSHVSEGKRRKWVFPTLVEFPKSKDRISSHGANKMQIFGLSPYWFDIAQGNAIHNTVDMQSLFLLTGPNGGGKSSLLRSICAAALLGICGLTVPAESALIPHFDSIMLHMKSYDSPADGKSSFQIEMSEIRSIIAGATARSLVLVDEICRGTETAKGTCIAGSIVETLDNISCLGVVSTHLHGIFDLPLNTKNIVYKAMGSENLNGHTRPTWKLIDGICRESLAFETAQGEGIPETVIHRAKELYLSLNEKEDASSGKSDAKVEHLSSDSDEVEEQLHRVKIGAIGMRMKALNSVEILRKEIASAVTIICQKKLIELYKQRNISELTEVNCVIISSREQPPPSTIGASSVYVLLRPDKKLYVGQTDDLEGRVRAHRSKEGMQNASFLYVIVPGKSIASQLETLLINQLPHQGFRLTNIADGKHRNFGTSSLSLESVVL |